Hi everyone,
Seeking help with the right syntax to examine the collinearity between two categorical variables in STATA.
I would like to include these two variables – mode of delivery (1. Non-labour, 2. Intrapartum and 3. vaginal delivery) and time of placental collection (1. <=30 min, 2. 31 to 60 min, 3. 61 to 90 min and 4. 91 and above) in my multivariate regression model. But, we suspected that there might be collinearity between these two categorical (MOD – nominal variables, and time of collection – ordinal variable).
Since “corr” is only applicable for those variables appear to be continuous, so I wonder if VIF is the right syntax for me to examine the collinearity for those two categorical variables here? I gave it a try of " vce,corr" as well.
And also how do I interpret the results shown below? As I know that 10 has been recommended as the cut off for VIF, so in this case, am I confident to say that there is a weak collinearity between MOD and time of placental collection? (please kindly refer to the bolded values of VIF results derived from Stata below)
Thank you and truly appreciate your assistance in this matter. ; )
Commands I used to perform VIF:
reg z_MI mo_age i.mo_Chinese pp_bmi i.parity_2 i.tobacco_2 GA_weeks i.child_sex i.new_MOD i.time_2 ogtt_2hour
vif
. vif
Variable VIF 1/VIF
mo_age 1.30 0.766470
2.mo_chinese 1.21 0.823911
pp_bmi 1.16 0.864906
2.parity_2 1.24 0.803848
1.tobacco_2 1.08 0.927086
GA_weeks 1.07 0.933429
2.child_sex 1.03 0.972162
new_MOD
2 1.60 0.623646
3 1.57 0.635354
time_2
2 2.29 0.436235
3 1.99 0.501730
4 1.63 0.613657
ogtt_2hour 1.10 0.907737
Mean VIF 1.41
Seeking help with the right syntax to examine the collinearity between two categorical variables in STATA.
I would like to include these two variables – mode of delivery (1. Non-labour, 2. Intrapartum and 3. vaginal delivery) and time of placental collection (1. <=30 min, 2. 31 to 60 min, 3. 61 to 90 min and 4. 91 and above) in my multivariate regression model. But, we suspected that there might be collinearity between these two categorical (MOD – nominal variables, and time of collection – ordinal variable).
Since “corr” is only applicable for those variables appear to be continuous, so I wonder if VIF is the right syntax for me to examine the collinearity for those two categorical variables here? I gave it a try of " vce,corr" as well.
And also how do I interpret the results shown below? As I know that 10 has been recommended as the cut off for VIF, so in this case, am I confident to say that there is a weak collinearity between MOD and time of placental collection? (please kindly refer to the bolded values of VIF results derived from Stata below)
Thank you and truly appreciate your assistance in this matter. ; )
Commands I used to perform VIF:
reg z_MI mo_age i.mo_Chinese pp_bmi i.parity_2 i.tobacco_2 GA_weeks i.child_sex i.new_MOD i.time_2 ogtt_2hour
vif
. vif
Variable VIF 1/VIF
mo_age 1.30 0.766470
2.mo_chinese 1.21 0.823911
pp_bmi 1.16 0.864906
2.parity_2 1.24 0.803848
1.tobacco_2 1.08 0.927086
GA_weeks 1.07 0.933429
2.child_sex 1.03 0.972162
new_MOD
2 1.60 0.623646
3 1.57 0.635354
time_2
2 2.29 0.436235
3 1.99 0.501730
4 1.63 0.613657
ogtt_2hour 1.10 0.907737
Mean VIF 1.41
Comment